Low-complexity and efficient classification of voiced/unvoiced/silence for noisy environments
نویسندگان
چکیده
This paper describes a low-complexity and efficient speech classifier for noisy environments. The proposed algorithm utilizes the advantage of time-scale analysis of the Wavelet decomposition to classify speech frames into voiced, unvoiced and silence classes. The classifier uses only one single multidimensional feature which is extracted from the Teager energy operator of the wavelet coefficients. The feature is enhanced and compared with quantile-based adaptive thresholds to detect phonetical classes. Furthermore, to save memory, the adaptive thresholds are replaced by a slope tracking method on the filtered feature. These algorithms are tested with the TIMIT database and additive white, car, factory noise, and compared with other methods to demonstrate their superior performance and robustness.
منابع مشابه
Denoising Of Speech Signal By Classification Into Voiced, Unvoiced And Silence Region
In this paper, a speech enhancement method based on the classification of voiced, unvoiced and silence regions and using stationary wavelet transform is presented. To prevent the quality of degradation of speech during the denoising process, speech is first classified into voiced, unvoiced and silence regions. An experimentally verified criterion based on the short time energy process has been ...
متن کاملVoiced/ Unvoiced Speech Discrimination Using Symbolic Dynamics
The aim of this study is to evaluate how far the nonlinear symbolic dynamics approach helps to characterize the nonlinear properties of speech and thereby discriminate between voiced and unvoiced speech segments. The symbolic dynamics calculations were performed on voiced speech, unvoiced speech and silence data. Differences were found in histogram properties and complexity measures of symbol s...
متن کاملVoiced - Unvoiced - Silence Classification via Hierarchical Dual Geometry Analysis
The need for a reliable discrimination among voiced, unvoiced and silence frames arises in a wide variety of speech processing applications. In this paper, we propose an unsupervised algorithm for voiced-unvoiced-silence classification based on a time-frequency representation of the measured signal, which is viewed as a data matrix. The proposed algorithm relies on a hierarchical dual geometry ...
متن کاملRobust automatic continuous-speech recognition based on a voiced-unvoiced decision
In this paper, the implementation of a robust front-end to be used for a large-vocabulary Continuous Speech Recognition (CSR) system based on a Voiced-Unvoiced (V-U) decision has been addressed. Our approach is based on the separation of the speech signal into voiced and unvoiced components. Consequently, speech enhancement can be achieved through processing of the voiced and the unvoiced compo...
متن کاملA Pattern Recognition Approach to Voiced — Unvoiced — Silence Classification with Applications to Speech Recognition
Absb-act—In speech analysis, the voiced-unvoiced decision is usually performed in conjunction with pitch analysis. The linking of voiced-unvoiced (V-UV) decision to pitch analysis not only results in unnecessary complexity, but makes it difficult to classify short speech segments which are less than a few pitch periods in duration. In this paper, we describe a pattern recognition approach for d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006